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ExPASy Home page Site Map Search ExPASy Contact us Proteomics tools Swiss-Prot 

»El for [ clostripain 



Search 



Swiss-Prot/TrEMBL 



Welcome to the SIB BLAST Network Service 



If results of this search are reported or published, please mention thatl 
the computation was performed at the SIB using the BLAST network service. 
The SIB BLAST network service uses a server developed at SIB and the NCBI^ 
BLAST 2 software. 




In case of problems, please read the online BLAST help . 

If your question is not covered, please contact < helpdesk@expasy . orq > . 

NCBI BLAST program reference [PMID : 9254694 ] : 

Altschul S.F., Madden T.L., Schaffer A. A. , Zhang J., Zhang Z., Miller W., 
Lipman D.J. Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Res. 25:3389-3402(1997). 



Query: 3 0 AA 

Date run: 2005-10-27 21:09:24 UTC+0100 on sib-gml . unil . ch 

Program: NCBI B LAS TP 1 . 5 . 4 -Paracel [2003-06-05] 

Database: EXPASY/UniProtKB 

2,482,761 sequences; 813,543,939 total letters 

UniProt Knowledgebase Release 6.3 consists of: 
UniProtKB/Swiss-Prot Release 48.3 of 25-Oct-2005: 196277 entries 
UniProt KB/TrEMBL Release 31.3 of 25-Oct-2005: 2273976 entries 



List of potentially matching sequences 



Send selected seque nces to | Clustel W ^ (multiple alignment) 



□ Include query sequence 



Db AC 

□ tr Q8 3U44 

□ tr Q4ZFT7 

□ tr Q4ZFT6 

□ tr 08626 4 

□ tr Q93MDQ 

□ tr Q4ZFT4 

□ tr Q50NG8 
C tr Q72MW1 

□ tr Q4AAS7 



Description 



Score E-value 



_CL0PE 
_CLOPE 
_CLOPE 
_CL0PE 
_CLOPE 
_CLOPE 
_ENTHI 
_LEPIC 
MYCHJ 



Beta2-toxiri [cpb2] [Clostridium perfringens] 9j3 2e-19 

Beta2 toxin [Clostridium perfringens] 9J5 2e-19 

Beta2 toxin [Clostridium perfringens] _95 2e-19 

Beta 2 toxin precursor [Clostridium perfringens C] 95 2e-19 

Beta2-toxin [cpb2] [Clostridium perfringens] 89 le-17 

Beta2 toxin [Clostridium perfringens] £T8 3e-17 

Receptor protein kinase, putative [442.t00003] [...48. 3e-05 

Hypothetical protein [LIC13078] [Leptospira inte. . . 36 0.083 

Signal peptidase I (EC 3.4.21.89) [sipS] [Mycopl... 35 0.20 
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□ tr Q5HX88 _CAMJR Cytolethal distending toxin, subunit C [cdtC] [C. . . 35 0.27 

□ tr Q46102 _CAMJE CdtC (Cytolethal distending toxin C) [cdtC] [Cam... 35 0.27 
W D tr Q7B3S5 _CAMJE CdtC protein (Fragment) [cdtC] [Campylobacter je... 35 0.27 

□ tr Q5F1K4 _CAMJE Cytolethal distending toxin C [cdtC] [ Campy 1 obac .. . 35 0.27 

□ tr 033865 _BACPU Plasmid pSH1452, Rep [Bacillus pumilus (Bacillus... 35 0.27 
D sp P04660 HEMA_IADNY Hemagglutinin precursor [Contains: Hemagglu. . . .34 0.48 

□ tr Q81L65 _BACAN Iron compound ABC transporter, iron compound -b in. . . 34_ 0.48 

□ tr Q894Q1 _CL0TE Membrane associated protein [CTC01484] [Clostrid. . . 33 0.87 

□ tr Q4A8Y6 MYCH7 Signal peptidase I (EC 3.4.21.89) [sipS] [Mycopl . . . 33 0.87 

□ tr Q54 JE1 _DICDI Hypothetical protein [DDB0187788] [Dictyostelium. . . 32 1.2 

□ tr. Q52TF1 CAMJE Cytolethal distending toxin C (Fragment) [cdtC] ... 32 1.6 

□ tr Q52TF0 _CAMJE Cytolethal distending toxin C (Fragment) [cdtC] ... 32 1.6 
IJ sp P45963 ACR7_CAEEL Acetylcholine receptor, alpha-type subunit ... 3J2 2.1 

□ tr Q8EZK0 LEPIN Hypothetical protein [LA3853] [Leptospira interr. . . 32 2.1 

□ tr Q731T5 _BACC1 HlyC domain protein [BCE4080] [Bacillus cereus (...32 2.1 
C tr Q41GQ2 9BACI Binding-protein-dependent transport systems inne... 32^ 2.1 

□ tr 059092 _PYRH0 Hypothetical protein PH1367 [PH1367] [Pyrococcus . . . 32 2.1 
[j tr Q8TID2 _METAC Predicted protein [MA4221] [Methanosarcina aceti... 32 2.1 
D sp Q98KBO MRAY_RHIL0 Phospho-N-acetylmuramoyl -pentapeptide -trans .. . 31 2.8 

□ tr Q8AWN0 _9SMEG Rhodopsin (Fragment) [Rhod] [Spinachia spinachia] 31 2.8 

□ tr Q7YTK2 _CAEEL Hypothetical protein [W01D2.6] [Caenorhabditis e... 31 2.8 

□ tr Q4IBS5 GIBZE Hypothetical protein [FG05333.1] [Gibberella zea... 31 2.8 

□ sp P87245 KMS1_SCHP0 Karyogamy meiotic segregation protein 1 [km... 31_ 3.8 

□ tr Q897V9 _CL0TE Hypothetical protein [CTC00614] [Clostridium tet . . . 31 3.8 
|J tr Q 4 88 C5 C0LP3 Arylsulf atase (EC 3.1.6.1) [atsA] [Colwellia psy. . . 31 3.8 

□ tr Q4YW58 PLABE Hypothetical protein [PB000503 . 02 . 0] [Plasmodium... 31 3.8 

□ tr Q4XSE9 _PLACH Hypothetical protein [PC000754 . 03 . 0] [Plasmodium... 31 ' 3.8 

□ tr 045280 _CAEEL Hypothetical protein srbc-83 [srbc-83] [Caenorha. . . 31 3.8 

□ tr Q6CS24 _KLULA Similar to sp|P25371 Saccharomyces cerevisiae YC... 31 3.8 
n tr Q3T6M7 _9PERO Rhodopsin (Fragment) [Archoplites interruptus] 30 5.1 
C tr Q3T6M3 _9PER0 Rhodopsin (Fragment) [Enneacanthus obesus] 30. 5 - 1 

□ tr Q3QJ06 _9GAMM Protein-disulf ide reductase precursor (EC 1.8.1.... 30 5.1 

□ tr Q3NFU5 _9HELI Cytochrome c, class I precursor [TmdenDRAFT_13 75 . . . 30 5.1. 

□ tr Q5ZGC8 _9VIRU Transmembrane glycoprotein [PHGllb_49] [Bacterio... 30 5.1 

□ tr Q61RZ6 _CAEBR Hypothetical protein CBG06382 [CBG06382] [Caenor. . . 30 5.1 

□ tr Q6YMD9 9HEMI Cytochrome oxidase subunit I (Fragment) [COI] [L. . . 30 5.1 

□ tr Q50SK5 ENTHI Hypothetical protein [256.t00009] [Entamoeba his... 30 5.1 

□ tr Q8U1W2 _PYRFU Hypothetical protein PF1092 [PF1092] [Pyrococcus... 30 5.1 

□ tr Q91F76 IRV6 450L [Chilo iridescent virus (CIV) (Insect irides... 30 . 6.8 

□ tr Q8QLT2 _9INFA Non structural protein 1 [NS1] [Influenza A viru. . . 30 6.8 

□ tr Q8Y8R0 _LISMO Putative peptidoglycan bound protein (LPXTG moti . . . 30 6.8 

□ tr Q6HR7 6 _BACAN Hypothetical protein [BAS4921] [Bacillus anthracis] 30 6.8 

□ tr Q4EVA3 _LISMO Cell wall surface anchor family protein [LM0f685... 30 6.8 
tr Q6C7M8 YARLI Similar to sp|P38778 Saccharomyces cerevisiae Tr. . . 30 6.8 

C tr Q4SLL1 JTETNG Chromosome 15 SCAF14556, whole genome shotgun se... 29 9.1 
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□ 


tr 


Q3T8J2 


_9NID0 


Polyprotein lab [pollab] [Breda virus] 


29 


9 


1 


□ 


tr 


Q3T8 Jl 


_9NID0 


Replicase [ORFla] [Breda virus] 


29 


9 


1 


D 


tr 


Q81LL9 


_BACAN 


Iron compound ABC transporter, iron compound-bin. . 


• 29 


9 


1 


1 ! 


tr 


Q6HDF6 


_BACHK 


Iron compound ABC transporter, iron compound-bin.. 


. 29 


9 


1 


□ 


tr 


Q634H5 


_BACCZ 


Iron compound ABC transporter, iron compound-bin. . 


• 29. 


9 


1 


n 


tr 


Q8R7D3 


__THETN 


Membrane-associated lipoprotein involved in thia.. 


. 29 


9 


1 


□ 


tr 


Q818A5 


_BACCR 


Ferrichrome -binding protein [BC4363] [Bacillus c. 


. 29 


9 


1 


□ 


tr 


Q73K80 


_TREDE 


FMN-binding domain protein [TDE2340] [Treponema . . 


. 29 


9 


1 


□ 


tr 


Q730G6 


_BACC1 


Iron compound ABC transporter, iron compound-bin. . 


• 29 


9 


1 


□ 


tr 


Q72R00 


_LEPIC 


Hypothetical protein [LIC11952] [Leptospira inte.. 


. 29 


9 


1 


n 


tr 


Q4MRX0 


_BACCE 


Ferrichrome-binding protein [BCE_G9241_4421] [Ba . . 


. 29 


9 


1 


□ 


tr 


Q4HL84 


_CAMLA 


Hypothetical protein [CLA04 84] [Campylobacter la.. 


. 29^ 


9 


1 


□ 


tr 


Q08858 


_KLEPN 


Fimbrial adhesin precursor [fimH] [Klebsiella pn. . 


• 29 


9 


1 


n 


tr 


Q7RJ39 


_PLAYO 


Hypothetical protein [PY03425] [Plasmodium yoeli.. 


. '29 


9 


1 



Graphical overview of the alignments 

Hff^SHS^H to resubmit your query after masking regions matching PROSITE profiles 
"™"*™ or Ffam HMMs 

(f> Help ) (use ScanProsite for more details about PROSITE matches) 



Profile hits mmiei^k^estsse^^ 
Pfan hits 
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Submission 




993H88 
4ZFT4 
5BNG8 
J72KM1 
kflfiS7 
35HX88 
346192 
57B3S5 
J5F1K4 

HEHJUIHDHY 
Q81L65 




El 
52TF1 
352TF8 
ACR^CAEEL 
Q8EZK8 
Q731T5 
Q4IG 



TI02 



i7YTIC2 
14XBS5 
CHSi-SCHPO 




J6CS24 
J3T6H7 
TT6H3 

If? 6 
J5ZGC8 

38U1M2 

LT2 
K76 

cths 

J3T8J2 
3T8J1 

ISH0F6 
3634H5 
H7D3 
81885 
I73IC88 



Hatches on query sequence 



1 



Hat 

1 




t"" ..J*. - 





S3 




ISO 



I7RJ39 



Subnission 



Identity 8 25 58 79 188% 



Alignments 
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tr Q8 3U4 4 

Q83U44_CLOPE 



Beta2-toxin [cpb2] [Clostridium perfringens] 265 AA 



align 



Score = 94.8 bits (216), Expect = 2e-19 
Identities = 29/30 (96%), Positives = 29/30 (96%) 

Query: 1 MKKIISLFTVIFMFSCFLIVGAISPMKASA 30 

MKKIIS FTVI FMFSCFLI VGAI SPMKASA 
Sbjct: 1 M KKI I SKFTV I FMFSCFLI VGAI SPMKASA 30 



tr Q4ZFT7 Beta2 toxin [Clostridium perfringens] 265 AA 

Q4ZFT7_CLOPE 



Score = 94.8 bits (216), Expect = 2e-19 
Identities = 29/30 (96%) , Positives = 29/30 (96%) 

Query: 1 MKKIISLFTVIFMFSCFLIVGAISPMKASA 30 

MKKIIS FTVI FMFSCFLIVGAI SPMKASA 
Sbjct: 1 MKKI I SKFTV I FMFSCFLI VGAI SPMKASA 30 



tr Q4ZFT6 Beta2 toxin [Clostridium perfringens] 265 AA 

Q4ZFT6_CLOPE 



Score = 94.8 bits (216), Expect = 2e-19 
Identities = 29/30 (96%) , Positives = 29/30 (96%) 

Query: 1 MKKIISLFTVIFMFSCFLIVGAISPMKASA 30 

MKKIIS FTVI FMFSCFLIVGAI SPMKASA 
Sbjct: 1 MKKI I SKFTVI FMFSCFLI VGAI SPMKASA 30 



tr 086264 Beta 2 toxin precursor [Clostridium perfringens C] 265 AA 

086264 CLOPE 



Score = 94.8 bits (216), Expect = 2e-19 
Identities = 29/30 (96%), Positives = 29/30 (96%) 

Query: 1 MKKIISLFTVIFMFSCFLIVGAISPMKASA 30 

MKKIIS FTVIFMFSCFLIVGAISPMKASA 
Sbjct: 1 MKKI I SKFTVI FMFSCFLI VGAI SPMKASA 30 



tr Q93MD0 Beta2- toxin [cpb2] [Clostridium perfringens] 265 AA 

Q93MD0 CLOPE 



align 



align 



align 



align 



http://www.expasy.org/cgi-bin/blast.pl 



10/27/05 



UniProtKB/Swiss-Prot entry P09870 [CLOSJXOHI] Alpha-clostripain 



Page 1 of 4 



fill ExPASy Home page Site Map 
Search |Swiss-Prot/TrEMBL 



Search ExPASy 
for j clostripain 



Contact us 



Swiss-Prot 



UniProtKB/Swiss- 
Prot entry P09870 



[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] 

[Features] [Sequence] [Tools] 

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents. 
Entry information 



Entry name 

Primary accession number 
Secondary accession number 
Entered in Swiss-Prot in 
Sequence was last modified in 
Annotations were last modified in 
Name and origin of the protein 
Protein name 
Synonyms 



CLOS_CLOHI 
P09870 

P09869 

Release 10, March 1989 
Release 26, July 1993 
Release 49, January 2006 



Contains 

Gene name 

From 

Taxonomy 



Alpha-clostripain [Precursor] 
EC 3.4.22.8 
Clostridiopeptidase B 
Alpha-clostripain light chain 
Alpha-clostripain heavy chain 

Name: cloSI 

Clostridium histolyticum [TaxID: 1498] 

Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; 
Clostridium. 

References 

[1] NUCLEOTIDE SEQUENCE [GENOMIC DNA]. 

DOI=10.1007/BF00276893; PubMed=8341259 [NCBI, ExPASy, EBI, Israel, Japan] 
Dargatz H., Diefenthal T., Witte V., Reipen G., von Wettstein D.; 

"The heterodimeric protease clostripain from Clostridium histolyticum is encoded by a single 
gene."; 

Mol. Gen. Genet. 240:140-145(1993). 
[2] PROTEIN SEQUENCE OF 5 1 - 1 8 1. 

PubMed=6391922 [NCBI, ExPASy, EBI, Israel, Japan] 
Gilles A.M., Lecroisey A., Keil B.; 
"Primary structure of alpha-clostripain light chain. 11 ; 
Eur. J. Biochem. 145:469-476(1984). 

[3] PRELIMINARY PROTEIN SEQUENCE OF 51-73 AND 191-232. 
PubMed=6337850 [NCBI, ExPASy, EBI, Israel, Japan] 
Gilles A.M., de Wolf A., Keil B.; ' 

" Amino-acid sequences of the active-site sulfhydryl peptide and other thiol peptides from the 
cysteine proteinase alpha-clostripain."; 
Eur. J. Biochem. 130:473-479(1983). 

Comments 
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• FUNCTION: Cysteine endopeptidase with strict specificity. 

• CATALYTIC ACTIVITY: Preferential cleavage: Arg-|-Xaa, including Arg-|-Pro bond, but not 
Lys-|-Xaa. 

• SUBUNIT: Heterodimer of a light chain and an heavy chain held together by strong noncovalent 
forces rather than by intramolecular disulfide bridges. 

• SIMILARITY: Belongs to the peptidase CI 1 family [view classification]. 

• DATABASE: NAME=Worthington enzyme manual; WWW="http://www.worthington- 
biochem.com/CP/" . 

Copyright 

This Swiss-Prot entry is copyright. It is produced through a collaboration between the Swiss Institute of Bioinformatics and 
the EMBL outstation - the European Bioinformatics Institute. There are no restrictions on its use as long as its content is in 
no way modified and this statement is not removed. 

Cross-references 
EMBL 



[EMBL / GenBank / DDBJ] 
[CoDingSequence] 



PIR 

MEROPS 
InterPro 

Pfam 

ProDom 

HOGENOM 

BLOCKS 

ProtoNet 

ProtoMap 

PRESAGE 

DIP 

ModBase 

SWISS- 
2DPAGE 

UniRef 



X63673;CAA45212.1;-; 
GenomicDNA. 
A29174; A29174. 
A29175; A29175. 
B29175;B29175. 
S35190; S35190. 
C11.001; -. 

IPR005077; Peptidase_C 11. 

Graphical view of domain structure. 

PF03415;Peptidase_Cll; 1. 

Pfam graphical view of domain structure. 

[Domain structure / List of seq. sharing at least 1 domain] 

[Family / Alignment / Tree] 

P09870. 

P09870. 

P09870. 

P09870. 

P09870. 

P09870. 

Get region on 2D PAGE. 

View cluster of proteins with at least 50% / 90% / 100% identity. 



Keywords 

Direct protein sequencing; Hydrolase; Protease; Signal; Thiol protease; Zymogen. 
Features 



i®0| 







Feature table viewer 



- « 



Feature aligner 



Key 


From 


To 


Length 


SIGNAL 


1 


27 


27 


PROPEP 


28 


50 


23 


CHAIN 


51 


181 


131 


PROPEP 


182 


190 


9 


CHAIN 


191 


526 


336 



FTId 



Description 

Potential . 

Potential. * PRO_0000028511 

Alpha-clostripain light chain. PRO_0000028512 

Linker. PRO_000002 8513 

Alpha-clostripain heavy chain. PRO_0000028514 



http://www.expasy.org/uniprot/P09870 
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ACTJSITE 


231 


231 






CONFLICT 


127 


127 


R -> 


NQL (in Ref . 2) . 


-CONFLICT 


176 


179 


HGGG 


-> GDGH (in Ref. 


CONFLICT 


197 


197 


S -> 


H (in Ref. 3) . 


CONFLICT 


213 


213 


I -> 


L (in Ref. 3) . 


CONFLICT 


216 


216 


H -> 


T (in Ref. 3) . 


CONFLICT 


232 


232 


L -> 


M (in Ref. 3) . 



Sequence information 

Length: 526 AA [This is the 
length of the unprocessed 
precursor] 

10 . 20 



Molecular weight: 59733 Da 
[This is the MW of the 
unprocessed precursor] 

30 40 



2) 



CRC64: E151372FF6C95BE7 [This 
is a checksum on the sequence] 



50 



60 



MliRRKVSTLL MTALITTSFL NSKPVYANPV TKSKDNNLKE VQQVTSKSNK NKNQKVTIMY 

70 80 90 100 110 120 

YCDADNNLEG SLLNDIEEMK TGYKDSPNLN LIALVDRSPR YSSDEKVLGE DFSDTRLYKI 



130 140 150 160 170 180 

EHNKANRLDG KNEFPEISTT SKYEANMGDP EVLKKFIDYC KSNYEADKYV LIMANHGGGA 

190 200 210 220 230 240 

REKSNPRLNR AICWDDSNLD KNGEADCLYM GEISDHLTEK QSVDLLAFDA CLMGTAEVAY 

250 260 270 280 290 300^ 

QYRPGNGGFS ADTLVASSPV VWGPGFKYDK IFDRIKAGGG TNNEDDLTLG GKEQNFDPAT 

310 320 330 340 350 360 

ITNEQLGALF VEEQRDSTHA NGRYDQHLSF YDLKKAESVK RAIDNLAVNL SNENKKSEIE 

370 380 390 400 410 420 

KLRGSGIHTD LMHYFDEYSE GEWVEYPYFD VYDLCEKINK SENFSSKTKD LASNAMNKLN 

430 440 450 460 470 480 

EMIVYSFGDP SNNFKEGKNG LSIFLPNGDK KYSTYYTSTK IPHWTMQSWY NSIDTVKYGL 



490 500 510 520 

NPYGKLSWCK DGQDPEINKV GNWFELLDSW FDKTNDVTGG VNHYQW P09870 in FASTA 

format 



View entry in original UniProtKB/Swiss-Prot format 
View entry in raw text format (no links) 

Report form for errors/updates in this UniProtKB/Swiss-Prot entry 



BLAST submission on 
BLAST ExPASy/SIB 

oratNCBI (USA) 



ScanProsite, MotifScan 




NIESI® 



NPSA Sequence analysis 
tools 



Sequence analysis tools: ProtParam, ProtScale, 
^^§§ Compute pI/Mw, PeptideMass, PeptideCutter, 
Dotlet (Java) 



Submit a homology modeling request to SWISS- 
MODEL 




http://www.expasy.org/uniprot/P09870 
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uropean Bioi nf ormati cs Institute 

wmmmtmmmm mmMMA ssi: . . i 



_ „ - 



Text Entry I EmblEntry 



^ Reset ^ 



^ Previous Entry J Entry 2 of 2 from Query 1 



Entry 
Information 

Entry from: EMBL 

A Ik 

Entry Options 

Launch analysis tool: 



iBIastN 



Launch j 



Link to related 
information: 



Link 



Save entry: T Save 



View: 

^ Printer Friendly J 



Goto: General Description References Additional Sequence 

General Information 

Primary Accession # X63673 



Accession # 

Entry Name 

Molecule Type 

Sequence Length 

Entry Division 

Sequence Version 

Creation Date 

Modification Date 

Description 

Description 

Keywords 

Organism 

Organism 
Classification 

References 



X63673 

EMBL:CHCLOSI 
genomic DNA 
2768 
PRO 

X63673.1 

21-FEB-1992 

19-JUL-1993 

C.histolyticum closl gene for alpha-clostripain 
closl gene; Clostripain.; 
Clostridium histolyticum 

Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiace* 



1. Diefenthal,T.; 

Submitted (28-JAN-1992) to the EMBL/GenBank/DDBJ dat 
Research, Dept. Biotechnology, Schaarstr.l, P.O.B. 20 65, 

Position 1-2368 

2. Dargatz,H.; Diefenthal,T.; Witte,V.; Reipen,G.; von Wettst 
The heterodimeric protease clostripain from Clostrid 
single gene 

Mol. Gen. Genet. 240(1):140-145 (1993) 

DOI 10.1007/BF00276893 

Pubmed 8341259 

Position 1-2368 
Additional Information 
Features 



http://srs.ebi.ac.uk/srsb^ 10/27/05 
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Key Location 
source 1..2768 



-35 signal 376.. 381 

-10 signal 397.. 402 

repeat region 420..438 

rbs 605..609 

Cds 618.. 2198 



Sequence 

Characteristics 
Sequence 



Qualifier Value 

db_xref taxon:1498 
mol_type genomic DNA 
organism Clostridium histolyti 



note palindrom-structure 
rpt_type INVERTED 



db_xref 
db_xref 
db — xref 
translatable 
gene 
EC_n umber 
product 
protein_id 
translation 



GOA:P09870 
InterPro:IPR0050' 
UniProtKB/Swiss-l 
11 

ClosI 
3.4.22.8 
clostripain 
CAA45212.1 



>CAA45212 

MLRRKVSTLLMTAL] 

YCDADNNLEGSLLNE 

EHNKANRLDGKNEFI 

REKSNPRLNRAICWE 



Length: 2768 BP, A Count: 1084, C Count:359, G Count 



http://srs.ebi.ac.uk/srsW^ 
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>embl 1X63673 | CHCLOSI C . histolyticum closl ge 
gctataatcaatgccatatatataaagattaaagtaatcttttc 
agtatcagctttattaattaattgcttttttaaaaaattatttc 
tctactaaatagttaaattcagcttttcatttttccatcactac 
ttatatcatagaatgcttttaccacgtaatataaaaagcaagac 
tgaacctaaagacctaatgtagatttttaacaaaacaaatgatt 
taacagttcagataacttcctttttgagattgtaatatataagt 
aaatataccccccgattgtaatgcttgaatattaattataatga 
aaaaaaataatttttttatgaagtttaagaaggattatcctaaa 
aaatataaaaataatgtaaaactttaaataataactcttataat 
atatggattaaattttaatttatagatgctataaaactataaat 
ttttggaggtatgtattatgttaagaagaaaagtatcaacacta 
taactacttcatttttaaattccaaacccgtatatgcaaatccc 
ataataacttaaaagaagtacaacaagttacaagcaagagtaat 
aagtaactattatgtactattgcgacgcagacaataacttggac 
atatcgaggaaatgaaaacaggatataaggatagtcctaatttc 
tagacagatccccaagatatagcagtgacgaaaaagttttaggt 
cacgtctttataagattgaacacaataaggcaaatagattagac 
cagaaataagtactactagtaaatatgaagctaacatgggggat 
aatttattgattattgtaaatctaattatgaggctgataaatat 
atcatggtggtggtgcaagggaaaaatcaaatccaagattaaat 
atgatagtaaccttgataaaaatggtgaagcagactgcctttat 
atcatttaacagaaaaacaatcagttgatttacttgcctttgat 
ctgcagaagtagcgtatcagtatagaccaggtaatggaggattt 
ttgcttcaagcccagtagtttggggtcctggattcaaatatgat 
taaaagctggtggaggaactaataatgaggatgatt-taacttte 
actttgatcctgcaaccattaccaatgagcaattaggtgcatta 
gagactcaacacatgccaatggtcgctatgatcaacacttaagc 
aagctgaatcagtaaaaagagccatagataatttagctgttaat 
aaaaatctgaaattgaaaaattaagaggaagtggaattcataca 
tcgatgaatattctgaaggagaatgggttgaatatccttatttt 
gtgaaaaaataaataaaagtgaaaattttagtagtaaaactaaa 
ctatgaataaattaaatgaaatgatagtttattcttttggagac 
aagaaggaaaaaatggattgagtatattcttacctaatggagat 
attatacatcaaccaagatacctcattggactatgcaaagttgc 
cagttaaatatggattgaatccttacggaaaattaagttggtgt 
ctgaaataaataaagttggaaattggtttgaacttctagattct 
atgatgtaactggaggagttaatcattaccaatggtaaaactca 
attgtaatttataaaatttatattcaactaaaaacttctgcctc 
tttctaaattgaggcaatattttaataattcattcctttatttc 
aaactgctttttgtctagataaacaatagatgctccaacaatta 
agaatatacagataaagcttctccaaataaaataaatccccaac 
cctatatattacaacttctacaattactgcattagcatttccga 
tatttgtgctaataaggaaacggcacctaacaagattaacaggc 
tggtattacaaagttatttctcatcaatactccagcaactatac 
ataaaacactatttcgtaatgatgatgtttcttagactttgcta 
tgaagcacttgccgataaaactgcacttaacatccaaagatggc 
gagtatct 



m 
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CHCLOSI_4; parent: CHCLQSI 
X63673; 

repeat_region 420.. 438 

/note="palindrom-structure" 
/ r p t_t ype = I N VE RT E D 

Sequence 19 BP; 

taaaaaaata attttttta 
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ID CLOS_CLOHI STANDARD; PRT; 526 AA. 

AC P09870; P09869; 

% DT 01-MAR-1989 (Rel. 10, Created) 

DT 01-JUL-1993 (Rel. 26, Last sequence update) 

DT 24-JAN-2006 (Rel. 49, Last annotation update) 

DE Alpha-clostripain precursor (EC 3.4.22.8 ) (Clostridiopeptidase B) 

DE [Contains: Alpha-clostripain light chain; Alpha-clostripain heavy 

DE chain] . 

GN Name=cloSI; 

OS Clostridium histolyticum. 

OC Bacteria; Firmicutes; Clostridia; Clostridiales ; Clostridiaceae ; 

OC Clostridium. 

OX NCBI_TaxID=14_98; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] . 

RX MEDLINE= 93341452 ; PubMed= 8341259 ; DOI= 10 . 1007 /BF0027 6893 ; 

RA Dargatz H. , Diefenthal T. , Witte V., Reipen G. , von Wettstein D.; 

RT "The heterodimeric protease clostripain from Clostridium histolyticum 

RT is encoded by a single gene."; 

RL Mol. Gen. Genet. 240:140-145(1993). 

RN [2] 

RP PROTEIN SEQUENCE OF 51-181. 

RX MEDLINE= 85076641; PubMed= 6391922 ; 

RA Gilles A.M., Lecroisey A., Keil B.; 

RT "Primary structure of alpha-clostripain light chain."; 

RL Eur. J. Biochem. 145:469-476(1984). 

RN [3] 

RP PRELIMINARY PROTEIN SEQUENCE OF 51-73 AND 191-232. 

RX MEDLINE= 83131688 ; PubMed= 6337850 ; 

RA Gilles A.M., de Wolf A., Keil B.; 

RT "Amino-acid sequences of the active -site sulfhydryl peptide and other 

RT thiol peptides from the cysteine proteinase alpha-clostripain."; 

RL Eur. J. Biochem. 130:473-479(1983). 

CC -!- FUNCTION: Cysteine endopeptidase with strict specificity. 

CC -!- CATALYTIC ACTIVITY: Preferential cleavage: Arg- | -Xaa, including 

CC Arg- | -Pro bond, but not Lys-|-Xaa. 

CC -!- SUBUNIT: Heterodimer of a light chain and an heavy chain held 

CC together by strong noncovalent forces rather than by 

CC intramolecular disulfide bridges. 

CC -!- SIMILARITY: Belongs to the peptidase Cll family. 

CC - ! - DATABASE : NAME = Worthing ton enzyme manual ; 

CC WWW= "ht tp : / /www . worthington-biochem . com/ CP/ " . 

CC 

CC This Swiss -Prot entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use as long as its content is in no way modified and this statement is not 

CC removed. 

CC 

DR EMBL; X63673 ; CAA45212 .1; -; Genomic_DNA. 

DR PIR; A29174 ; A29174. 

DR PIR; A29175; A29175. 

DR PIR; B29175 ; B29175. 

DR PIR; S35190 ; S35190. 

DR MEROPS; Cll. 001 ; -. 

DR InterPro; IPR005077 ; Peptidase_Cll . 

DR. Pfam; PF03415 ; Peptidase Cll ; 1. 

KW Direct protein sequencing ; Hydrolase ; Protease ; Signal ; 

KW Thiol protease ; Zymogen . 

FT SIGNAL 1 27 Potential. 
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FT PROPEP 28 50 Potential. 

FT /FTId= PR0 0000028511 . 

m FT CHAIN 51 181 Alpha-clostripain light chain. 

*FT /FTId= PRO 0000028512 . 

FT PROPEP 182 190 Linker. 

FT /FTId= PRO 0000028513 . 

FT CHAIN 191 526 Alpha-clostripain heavy chain. 

FT /FTId= PRO 0000028514 . 

FT ACT SITE 231 231 

FT CONFLICT 127 127 R -> NQL (in Ref. 2). 

FT CONFLICT 176 17 9 HGGG -> GDGH (in Ref. 2) . 

FT CONFLICT 197 197 S -> H (in Ref. 3). 

FT CONFLICT 213 213 I -> L (in Ref. 3) . 

FT CONFLICT 216 216 H -> T (in Ref. 3). 

FT CONFLICT 232 232 L -> M (in Ref. 3). 

SQ SEQUENCE 526 AA; 59733 MW; E1513 72FF6C95BE7 CRC64 ; 

MLRRKVSTLL MTALITTSFL NSKPVYANPV TKSKDNNLKE VQQVTSKSNK NKNQKVTIMY 
YCDADNNLEG SLLNDIEEMK TGYKDSPNLN LIALVDRSPR YSSDEKVLGE DFSDTRLYKI 
EHNKANRLDG KNEFPEISTT SKYEANMGDP EVLKKFIDYC KSNYEADKYV LIMANHGGGA 
REKSNPRLNR AICWDDSNLD KNGEADCLYM GEISDHLTEK QSVDLLAFDA CLMGTAEVAY 
QYRPGNGGFS ADTLVASSPV VWGPGFKYDK IFDRIKAGGG TNNEDDLTLG GKEQNFDPAT 
ITNEQLGALF VEEQRDSTHA NGRYDQHLSF YDLKKAESVK RAIDNLAVNL SNENKKSEIE 
KLRGSGIHTD LMHYFDEYSE GEWVEYPYFD VYDLCEKINK SENFSSKTKD LASNAMNKLN 
EMIVYSFGDP SNNFKEGKNG LSIFLPNGDK KYSTYYTSTK IPHWTMQSWY NSIDTVKYGL 
NPYGKLSWCK DGQDPEINKV GNWFELLDSW FDKTNDVTGG VNHYQW 

// 

<entry accession="UPI0000127ADB"> 
<dbRef erenceList> 

<dbReference db="EMBL" id="CAA45212 " version="l" version_i="l" active="Y" create 
<dbReference db="SWISSPROT n id="P09870" version_i="l" active="Y" created="29-Mar ^ 
<dbReference db="PIRARC" id="S35190" version_i= " 1 " active="Y M created="31-Mar-20 
<dbReference db= n PIR" id="S35190" version_i="l" active="Y" created= M ll-Apr-2003" 
</dbRef erenceList> 

<sequence length= "526 " crc64= "E151372FF6C95BE7 " > 

MLRRKVS TLLMTAL I TTS FLNS KPVYANPVTKS KDNNL KEVQQVTS KSNKNKNQKVT I MY 

YCDADNNLEGSLLNDIEEMKTGYKDSPNLNLIALVDRSPRYSSDEKVLGEDFSDTRLYKI 

EHNKANRLDGKNEFPEISTTSKYEANMGDPEVLKKFIDYCKSNYEADKYVLIMANHGGGA 

REKSNPRLNRAICWDDSNLDKNGEADCLYMGEISDHLTEKQSVDLLAFDACLMGTAEVAY 

QYRPGNGGFSADTLVAS S P WWGPGFKYDKI FDRI KAGGGTNNEDDLTLGGKEQNFDPAT 

ITNEQLGALFVEEQRDSTHANGRYDQHLSFYDLKKAESVKRAIDNLAVNLSNENKKSEIE 

KLRGSGIHTDLMHYFDEYSEGEWEYPYFDVYDLCEKINKSENFSSKTKDLASNAMNKLN 

EMI VYSFGDPSNNFKEGKNGLS IFLPNGDKKYSTYYTSTKI PHWTMQSWYNS IDTVKYGL 

NPYGKLSWCKDGQDPEINKVGNWFELLDSWFDKTNDVTGGVNHYQW 

</sequence> 

</entry> 
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: Launch ) 
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FT 
FT 
SQ 



CHCLOSIJL; 

X63673; 

source 



Sequence 2 
gctataatca 
^gtatcagct 
' tctactaaat 
t ttatatcata 
tgaacctaaa 
^taacagttca 
aaatataccc 
aaaaaaataa 
aaatataaaa 
^atatggatta 
ttttggaggt 
taactacttc 
ataataactt 
aagtaactat 
atatcgagga 
tagacagatc 
cacgtcttta 
cagaaataag 
aatttattga 
atcatggtgg 
atgatagtaa 
atcatttaac 
ctgcagaagt 
ttgcttcaag 
taaaagctgg 
actttgatcc 
gagactcaac 
aagctgaatc 
aaaaatctga 
tcgatgaata 
gtgaaaaaat 
ctatgaataa 
aagaaggaaa 
attatacatc 
cagttaaata 
ctgaaataaa 
atgatgtaac 
attgtaattt 
tttctaaatt 
aaactgcttt 
agaatataca 
cctatatatt 



parent: CHCLOSI 
1. .2768 

/db_xref^" taxon: 14 98 " 
/mol_type=" genomic DNA" 
/organism^" Clostridium histolyticum" 
768 BP; 

atgccatata tataaagatt aaagtaatct tttgtatc 
ttattaatta attgcttttt taaaaaatta tttgtaag 
agttaaattc agcttttcat ttttccatca ctaccata 
gaatgctttt accacgtaat ataaaaagca agagtcct 
gacctaatgt agatttttaa caaaacaaat gattgatc 
gataacttcc tttttgagat tgtaatatat aagtaaat 
cccgattgta atgcttgaat attaattata atgaaatt 
tttttttatg aagtttaaga aggattatcc taaacatt 
ataatgtaaa actttaaata ataactctta taatggta 
aattttaatt tatagatgct ataaaactat aaattaaa 
atgtattatg ttaagaagaa aagtatcaac actattaa 
atttttaaat tccaaacccg tatatgcaaa tccagtaa 
aaaagaaota caacaagtta caJTgfeaagag taataaaa 
tatgtacfff^t tgcgacgcag acaataactt ggaaggaa 
aatgaaaaca ggatataagg atagtcctaa tttaaatt 
cccaagatat agcagtgacg aaaaagtttt aggtgaag 
taagattgaa cacaataagg caaatagatt agacggta 
tactactagt aaatatgaag ctaacatggg ggatcctg 
ttattgtaaa tctaattatg aggctgataa atatgtgc 
tggtgcaagg gaaaaatcaa atccaagatt aaatagag 
ccttgataaa aatggtgaag cagactgcct ttatatgg 
agaaaaacaa tcagttgatt tacttgcctt tgatgcat 
agcgtatcag tatagaccag gtaatggagg attttctg 
cccagtagtt tggggtcctg gattcaaata tgataaga 
tggaggaact aataatgagg atgatttaac tttaggtg 
tgcaaccatt accaatgagc aattaggtgc attatttg 
acatgccaat ggtcgctatg atcaacactt aagctttt 
agtaaaaaga gccatagata atttagctgt taatctaa 
aattgaaaaa ttaagaggaa gtggaattca tacagatt 
ttctgaagga gaatgggttg aatatcctta ttttgacg 
aaataaaagt gaaaatttta gtagtaaaac taaagatt 
attaaatgaa atgatagttt attcttttgg agacccta 
aaatggattg agtatattct tacctaatgg agataaaa 
aaccaagata cctcattgga ctatgcaaag ttggtata 
tggattgaat ccttacggaa aattaagttg gtgtaaag 
taaagttgga aattggtttg aacttctaga ttcttggt 
tggaggagtt aatcattacc aatggtaaaa ctcaatat 
ataaaattta tattcaacta aaaacttctg cctcaatc 
gaggcaatat tttaataatt cattccttta tttcaact 
ttgtctagat aaacaataga tgctccaaca attagtat 
gataaagctt ctccaaataa aataaatccc caagcgga 
acaacttcta caattactgc attagcattt ccgaaagc 
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tatttgtgct aataaggaaa cggcacctaa caagattaac aggattaa 
tggtattaca aagttatttc tcatcaatac tccagcaact atagtaga 
ataaaacact atttcgtaat gatgatgttt cttagacttt gctaattt 
tgaagcactt gccgataaaa ctgcacttaa catccaaaga tggcatag 
gagtatct 
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ID L77965 standard; genomic DNA; PRO; 1392 BP. 

XX 

AC L77965; 
XX 

SV L77965.1 
XX 

DT 28-JUL-1998 (Rel . 56, Created) 

DT 29-JUL-1998 (Rel. 56, Last updated, Version 2) 
XX 

DE Clostridium perfringens C beta 2 toxin gene, complete cds. 

XX 

KW 

XX 

OS Clostridium perfringens C 

OC Bacteria; Firmicutes; Clostridia; Clostridiales ; Clostridiaceae; 

OC Clostridium. 

XX 

RN [1] 
RP 1-1392 

RX DOI; 10. 1016/S0378-1119 (97) 004 93-9 
RX ^PUBMEDy ^ 426008 . 

Giber_t^ ,M.^> Jolivet-Reynaud C, Popoff M.R.; 
RT* , *^"" ; Beta2 toxin, a novel toxin produced by Clostridium perfringens"; 
RL Gene 203 (1) :65-73 (1997) . 
XX 

RN [2] 

RP 1-1392 

RA Popoff M.R. ; 

RT 

RL Submitted (15-JAN-1998) to the EMBL/ GenBank/DDB J databases. 

RL Toxines Microbiennes , Institut Pasteur, Paris cedex 15 75724, France 

XX 

CC GSDB:S: 76036. 

CC [Flatfile retrieved from GSDB Fri Jul 24 15:39:17 1998]. 
XX 

FH Key Location/Qualifiers 
FH 

FT source 1..1392 

FT /db xref= taxon: 79668 " 

FT /mol_type= n genomic DNA" 

FT /organism="Clostridium perfringens C" 

FT /strain="CWC245" 

FT RBS 255.. 260 

FT /note=" putative" 

FT sigpeptide 268.. 357 

FT /note= M putative M 

FT CDS 268.. 1065 

FT /codon_ start=l 

FT /db xref= TJniProtKB/TrEMBL: 086264 " 

FT /note=" putative" 

FT /transl_table=ll 

FT /product="beta 2 toxin" 

FT /protein id= AAC27654 . 1" 
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FT / trans 1 at ion = " MKKI ISKFTVI FMFSCFLI VGAI SPMKASAKEIDAYRKVMENYLN 

FT ALKNYDINTVWISEDERVNNVEQYREMLEDFKYDPNQQLKSFEII^SQKSDNKEIFOT 

FT KTEFLNGAIYDMEFWSSKDGKLIVSDMERTKVENEGKYILTPSFRTQVCTWDDELAQA 

FT IGGWPQTYSDRFTYYADNILLNFRQYATSGSRDLKVEYSVVDHW^KDDVKASQMVYG 

FT QNPD SARQ I RLY I E KGQS FYKYR I RI KNFTPAS I RVFGEGYCA " 

FT mat peptide 358 . . 1062 

FT /note= 11 putative" 

FT /product = " beta 2 toxin" 

FT terminator 1066.. 1104 

FT /note=" putative" 

XX 

SQ Sequence 1392 BP; 606 A; 115 C; 209 G; 462 T; 0 other; 

atttgggata tcttaaattt agcacagaag aatgtttaaa tgaaataaag ataataaaaa 60 

gatatattaa ttatatagct gaaaatttat aattatatga taagtatagt taataaataa 120 

aaagtgttct cgggggacac ttttttgttt taaaaaggaa aatataaata aaatttagat 180 

aaaagtgtaa aataattatt tttattttaa atttgttaaa aatttgatat aattgaattg 240 

taaaaaaaat ttcagggggg aatataaatg aaaaaaatta tttcaaagtt tactgtaatt 300 

tttatgtttt catgttttct tattgttgga gcaataagtc caatgaaagc aagtgcaaaa 360 
gaaatcgacg cttatagaaa ggtaatggag aattatctta atgctttaaa aaactacgat - 420 

attaatacag ttgtaaacat ttcagaagat gaaagagtaa ataatgttga acagtataga 480 

gaaatgttag aagattttaa atatgatcct aaccaacaac tgaaatcttt tgaaatactt 54 0 

aattcacaaa agagcgataa taaagaaata tttaatgtaa aaactgaatt tttaaatggt 600 

gcaatttatg atatggaatt tactgtatca tctaaagatg gaaaattaat agtatctgat 660 

atggaaagaa caaaagttga gaatgaagga aaatatattt taacaccatc atttagaact 720 

caagtttgta catgggatga tgaactagca caagcaattg ggggagttta tccacaaaca 780 

tattctgata gatttacata ttatgcagat aatatattat taaacttcag acaatatgca 840 

acttcaggtt caagagattt aaaagtagaa tatagtgttg tagatcattg gatgtggaaa 900 

gatgatgtta aagcttctca aatggtatat ggtcaaaatc ctgattctgc tagacaaata 960 

agattatata tagaaaaagg acaatctttc tataaatata gaataagaat taaaaacttt 1020 

acacctgcat caattagagt atttggtgaa gggtattgtg catagaaaaa aatatgaagt 1080 

gacttagtca cttcatattt tttttactat taattttatt atataaaaac ctaacataca 1140 

tgaaagtatt cttaatacag ttatatcaaa attaaagtag gggaaataaa ataaaaggct 1200 

aaaaactata ttaaaaacta taaaaattat taaattaggt tttaaggtgt tatatttatt 1260 

tatgattata ggaataaata tgccaaatgg aataaataaa agtaatatta ataattggtc 1320 

taaaaagtat acatcattga taaaagaaaa attaccagta aaaattgagc ttaaaaaatt 13 80 

aaatgtaaat tt 1392 

// 
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